Canonical fitness model for simple scale-free graphs 
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We consider a fitness model assumed to generate simple graphs with power-law heavy-tailed degree 
sequence: P(k) <x with < a < 1, in which the corresponding distributions do not posses a 

mean. We discuss the situations in which the model is used to produce a multigraph and examine 
what happens if the multiple edges are merged to a single one and thus a simple graph is built. We 
give the relation between the (normalized) fitness parameter r and the expected degree v of a node 
and show analytically that it possesses non-trivial intermediate and final asymptotic behaviors. We 
show that the model produces P(k) tx k~ 2 for large values of k independent of a. Our analytical 
findings are confirmed by numerical simulations. 



I. INTRODUCTION 

Large systems of objects are often described in terms 
of networks, i.e. are represented by abstract graphs com- 
posed of vertices (objects) and edges (links, interconnec- 
tions) [1-6]. The number of vertices is called the order 
and the number of edges the size of the graph [7]. The 
number k of incident edges of a given vertex is called its 
degree, and the probability distribution P{k) the degree 
distribution of the graph. A special feature of many real 
world networks is that the probability P(k) oc A; -7 with 
some 7 > 1, i.e. these graphs show scale- free nature. 

The two classical models of Gilbert [8] and Erdos-Renyi 
[9] do not show this property. The first one is also known 
by the term Bernoulli graph [8, 10]. In this model which 
is usually denoted by Gm, p , one starts with a set of M 
labeled nodes. Each possible pair of vertices is linked 
with an edge with probability p. This leads to a sim- 
ple graph with no one-loops. The other random graph 
model was introduced by Erdos and Renyi [9] as a sta- 
tistical ensemble Gm,n that contains all possible graphs 
with M nodes and N links (without multiple edges and 
one- loops) , and whose members are all of equal statistical 
weight. Austin et al. [11] however suggested that multi- 
ple edges should also be permitted. These two classical 
models of Gilbert [8] and Austin et al. [11] correspond 
to canonical and grand canonical ensembles, respectively 
[12], with respect to the number of edges. When refer- 
ring to the classical canonical model, we mean that both 
multiple edges and one-loops are allowed (see p. 9 of 
[12]), i.e. it generates a multigraph. The difference to 
the grand canonical model is that the number N of links 
(i.e. the network's size) is fixed and is the same for every 
realization of Gm,n- 

In both classical models the degree distribution P(k) 
tends to a Poissonian P(k) ~ e~ x \ k /k\ in the limit 
N — > oo, where A = (k) denotes the mathematical ex- 
pectation of the vertex's degree. Therefore these models 
cannot mimic the scale-free behavior of many real- world 
networks. 

The scale-free behavior can be reproduced by consider- 
ing growing networks, for example it can be generated by 
preferential attachments algorithms [13], or by modifying 



classical approaches which leads to fitness models. 

Thus, Caldarelli et al. [14] suggested a fitness model 
where each vertex is assigned a fitness parameter. For 
each possible connection between two vertices the prob- 
ability that the connection exits is given by a linking 
function of the two fitnesses. This corresponds to a gen- 
eralization of Gilbert's model. In the present work, we 
examine a certain graph generation model equivalent to 
the model suggested by Goh et al. [15] which is a gen- 
eralization of Austin et al's model. The model can also 
be considered as a special case of Caldarelli et al. [14]. 
The authors of [15] assumed it as a trivial statement 
that the degree distribution follows the fitness distribu- 
tion in its asymptotics. As we proceed to show, the 
approach of [15] does not work for fitness distributions 
lacking the mean. In what follows we discuss this sit- 
uation in detail and show that for heavy-tailed fitness 
distributions Pf(f) oc with < a < 1, the de- 

gree distributions P(k) oc fc~ 2 are universally produced 
independent of a. The model is therefore not applica- 
ble to mimic such networks as the network of word co- 
occurrence (a — 0.8, [16, 17]) and the network of homo- 
sexual contacts (a = 0.6, [18]). 



II. FITNESS MODELS 

As we pointed out above, both classical models can be 
altered in such a way that each vertex i is assigned a 
positive random value fi which influences its chance to 
attract links in the building process. These values are 
called the (intrinsic) fitness parameters. For each vertex 
its fitness parameter is chosen randomly according to a 
given probability density function (pdf) Pf{f). Thus, 
the one-step procedure of building the classical models is 
extended to a two-step procedure in which both fitness 
parameters and degrees are determined randomly. 

For the grand canonical model this generalization was 
investigated in [14]: For every pair ij a link is drawn with 
a linking probability F(fi,fj) where F is a symmetric 
function in its arguments. A natural choice for the link- 
ing probability is F(fi,fj) = {fifj)/fl[ where f M is the 
largest value of the /'s in the considered network. Usu- 
ally the distribution p/(f) relates to the intended degree 
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distribution P(k). 

In our case we want to generate networks with degree 
distribution P(k) oc /c _1_ " with < a < 1, i.e. the ones 
lacking the mean. We attempt to achieve this by choos- 
ing pf(f) oc J -1- ". The problem arises when we want 
to target a given number of edges because the expected 
number of edges i . fifj/f M is a strongly fluctuating 
quantity, which is not the case for a > 1. Hence, in 
the absence of the mean degree, this model provides only 
small control over the networks size which urges us to 
concentrate on the canonical fitness model. 

The Canonical Fitness Model starts again with M 
vertices that have each an intrinsic fitness parameter dis- 
tributed according to the pdf pf. We choose the proba- 
bility F(fi, fj) that a given link is connected to the nodes 
i and j to be 

F(fnfj) = — j— 2- 
\l^k fa) 

At first, we will generate multigraphs and explain later 
how this model can be modified to produce simple graphs. 

It can be easily seen that for each vertex i with fit- 
ness parameter fi the degree expectation value (fcj) = 
Aj = 2Nfi/^2j fj- This justifies the introduction of the 
normalized fitness parameters rf. 
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so that Aj = 2Nri. Thus, the distribution of degree ex- 
pectations pa (A) = p r (X/2N)/2N has the same shape as 
the distribution of normalized fitnesses p r (r). For a > 1 
the distribution of n and therefore of Aj follows that of 
/i, but in the case of heavy-tailed distributions a < 1 it 
differs in shape from Pf(f) [19]. In what follows we con- 
centrate on the behavior of p r (r) which gives the whole 
relevant information on the degree distribution. 

Note, that even for narrow distributions of fitnesses 
the degree distribution differs from the fitness distri- 
bution. It becomes obvious when considering the case 
Pf(f) oc S(f — f*) with some positive value /*. Here both 
models are reduced to their respective classical model 
with its Poisson-shaped degree distribution. Thus, we 
understand that for each vertex i with the degree expec- 
tation Xi, the probability P»(fe), that its actual degree 
is k, is: Pj(fc) = \^e~ Xi /k\ Hence, the overall degree 
distribution P(k) = ^e~ A p A (A)dA. We call this the 
intrinsic Poisson behavior. Although the effect of this be- 
havior on the resulting actual degree distribution persists 
for slowly decaying p r , the asymptotic behaviors coincide 
[20]. 

Now, in the case of heavy-tailed distributions the ran- 
dom variable r (which represents the r,) can be expressed 
as a function of a random variable R which is a quotient 
of two independent one-sided Levy variables with expo- 
nent a [20]: 
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1 + xR' 



(1) 



with x = (M- 1) 1 /" « M x l a . The pdf p R of the variable 
R is known to fulfill the following relation to the Mittag- 
Leffler function E a (z) = J2n=o z n /T(an + 1) [21]: 



OO 

/■ 



e- uH p R (R)dR = E a (-u a ). 



(2) 



The full derivation of Eq. (2) is given in the appendix 
of [20] which is based on the analytic methods of Chap. 
XIV in [22]. Since Eq. (2) displays a Laplace transform 
of pa, the asymptotic behavior of pr for large values of 
R can be obtained via Corollary 8.1.7 of [23] which is an 
extension of the Tauberian theorems: From 



l-E a (-u a ) 



r(a + l) 



for u — > 



it follows that 



r ( «£a-«) fOT (3) 

Because R is a quotient of two identically distributed 
non-negative random variables, the distributions of R 
and 1/R are the same. Since 

p R {R) = p 1/R (R) = -^p R ( ' 



T(a)T(l -a) 
we have 



R 2in \R ' 



PR [-R - 



R 



l-Q 



R) T(a)T(l-a) 



for R — ?• oo. 



Thus, we can also determine the behavior of pr{R) for 
small R: 



pr{R) 



R 



a-l 



T(a)T(l - a) 
Using the connection between p r and pr: 



for R -> 0. (4) 



1 



p r {r) = -^PR - 



xr^ 



- 1 



(5) 



we can estimate the behavior of p r for small and large 
values of r. For r — > with 1/r — > oo, we can use Eq. 
(3): 

p r {r) ~ — — i-i- for r^0; (6) 

r(a)r(l — a) 

for r -> 1, i.e. (1/r) — 1 -> 0, we use Eq. (4): 



-1— a / 1 /K .\a — 1 



(l-r-r 



MT(a)r(l-a) 



for r -> 1. (7) 



For r < 1 this behaves as p r (r) ~ r^ a /(Mr(tt)r(l - 
a)). 
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On the total p r shows the following regimes: 



and performing the integration leads in the lowest non- 
vanishing order to: 



Pr(r) ^ < 



TO forr<M- 1 /«, ( 8 a) 
Mrwrd-a) forM-V«< r «l, (gb) 

(8c) 



oo , . -.n 

M ^ n\ K ' 

n=0 

«1-(1-A<r». 
The first moment of r is given by: 



r -i-« (1 _ rr -i 

MT(a)r(l-a) 



The transition between the first two regimes takes place 
at r* = 1/x = M -1 /". The pdf p\(X) of degree expecta- 
tion values follows: 



Pa(A) = ± Pr (A 



III. SIMPLE GRAPHS 

Now we use this model to generate simple graphs. One 
possibility to achieve this is to start with the ordinary 
canonical fitness model and merge the multiple edges to 
a single one as it is done in [15]. The probability pij that 
there is an edge between any two vertices i and j is due 
to Poissonian statistics: 



Pij 



= l-e~ 2Nrir '. 



(9) 



Note that Eq. (9) can also be used to define the linking 
probability F(fi, fj) = in the grand canonical model. 

Merging the edges changes the degree distribution of 
a graph. We first examine the degree expectation values 
Vj = (k^) of vertices j of this simple graph. 



A. Relation between Fitness and Degree 
Expectation 

In a network of M vertices and N thrown edges, the 
expected degree Vj of a node j with normalized fitness rj 
is: 

Vj = (1 - e~ 2Nr * ri ) = M ^1 — J e- 2Nr > r Pr (r)dr^ . 

Let Xj again denote 27V Tj and consider v as a function 
of A: 

l 

^ = 1 - / ^ r Pr(r)dr = M (l - (e"^)) . (10) 



Here the two cases A <c 1 and A ^> 1 have to be con- 
sidered separately. 

For A <C 1, the argument of the exponential exp[— Xr] 
is small. Expanding the exponential into a Taylor series 



(11) 
(12) 



OO OO 

(r) = J r(R) PR (R)dR = J ^J-^p^RJdR. (13) 



It was evaluated in [20] by using: 



OO 

_J_ = i [ e u/x e -ul 

1 + xR x J 



du. 



(14) 



For the sake of completeness we will recall the calculation 
here: Substituting Eq. (14) into Eq. (13) and changing 
the order of integration yields: 



OO OO 



(r) 



\ J CU/X I e ~ UR P R ^ dRdu 







= ly e u/x E a {-u a )du. 
o 

The last integral is the Laplace transform of the Mittag- 
Leffier function which is given by C{E a (— u a )} = 
s a-i/( s « + i) [22]. Setting s = 1/x, we find: 



(r) 



1 1 



(15) 



x a + 1 M 

Thus, for A < 1, we have found out that v(X) = A, i.e. 

v{r) cx 2Nr (A < 1). (16) 

For A ^> 1, the expansion of the exponential into a 
power series as in Eq. (11) is not eligible since the series 
in Eq. (11) converges slowly if at all. Hence, for the cal- 
culation of (exp (—Xr)}, we resort to directly substituting 
Eq. (1) into Eq. (10): 

1 OO 

^ = 1 - j e- Xr p r (r)dr = 1- J p R {R)dR. 

o 

(17) 



Now we remember that u> = 1/R has the same pdf as R, 
and we write Eq. (17) as: 



oc 



e x +» p R (w)du). 



(18) 
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FIG. 2. Distribution of the averaged degrees v for a = 0.5. The 
order of the networks is M = 10 3 and the size N = 10 5 . The empty 
circles are the simulation results obtained by binning the data. The 
dashed line shows the asymptotic v~ 2 -behavior. The dotted line 
represents the intermediate asymptotic behavior pi/(u) <x v~ 1— a . 



FIG. 1. Expected degree v of a vertex in dependence on its nor- 
malized fitness parameter r. The order of the networks is M = 10 3 
and the size N = 10 s , the Levy-exponent is a = 0.5. The sim- 
ulated data are shown by gray dots. The dashed line shows the 
approximation of Eq. (22a): u(X) = A, which holds for small A. 
For large values of A, z^(A) behaves as the solid line indicates, see 
Eq. (22b). 



For large values of A the behavior of v(\) is determined 
by the behavior of the integrand at small values of u>, 
where the Eq. (18) can be approximated by: 



M ~ 1 



(19) 



The error of this approximation is of the order of 1/A. 
Evaluating the integral in Eq. (19) and using Eq. (2) we 
get: 



— = 1 - E n 
M 



1 - E a \ 

1 M 



(20) 
(21) 



The asymptotes of Eq. (16) and Eq. (21) intersect at 
A* w [r(a + l)] 1/(Q - 1} . 
Overall, we find: 



2Nr 

M (l — E„ 



for r <C 



(2jVr) c 
M 



27V' 
else. 



(22a) 
(22b) 



with the Levy-exponent a = 1/2. In the simulations we 
first generate a realization of the fitness distribution and 
calculate the mean degrees of the vertices by averaging 
over 100 realizations of the network. This is repeated for 
100 fitness realizations following from the same fitness 
distribution. For small values of A the averaged degrees 
behave as v(X) = A. 

For 1 < A C i we use the expansion of E a for small 



values of the argument v/M 



Mr(a+1) ' 



thus: 



(2Nr) a „ A* M 1 / 

r&TT) for m <<r<< ^N 



(23) 



B. Distribution of Degree Expectations 

Using the dependence v(r) as given by Eqs. (22a) and 
(22b), we can calculate the distribution p v {v) of degree 
expectations in the considered network: 



dr{v) 



Av 



Pr {r{v)) ■ 



(24) 



Eqs. (22a) and (22b) yield: 



r ~ < 



2N 

prp+i)) 1 /" 

2N 

(-MS-i(l-ft)) 1 
2JV 



for v -c A*, (25a) 

forA *« J '«^ ( 25b ) 
else, (25c) 



In Fig. 1 this is compared to simulation results obtained 
by studying networks of order M = 10 3 and size N = 10 5 , 



where we have singled out explicitly the intermediate 
regime Eq. (25b). Together with Eqs. (8a) to (8c), this 
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gives the distribution of degree expectation values p v . 
We assume that the considered networks are sparse, i.e. 
p = 2N/M 2 < 1 and hence the condition of Eq. (22b) 
or Eq. (25c), respectively, does not apply. Moreover, we 
concentrate on the values of r in M~ x / a < r -C 1 where 
p r {r) ~ r _1_Q . The smaller values of r are irrelevant for 
networks of moderate size and the large values of r only 
describe the cut-off of the actual power-law. Thus, we 
have to examine the two cases: 



Pv{v) 



— ( — 

2N Pr \ 2NJ 



(26) 



for the domain of Eq. (25a) and 

Pv{v) * 2N~a Pr { 2N ) 

(27) 

for the domain of Eq. (25b) . This gives two asymptotes 
which intersect at v* = [affa + l)] 1 ^" -1 ^: 



i/- 1 -°(2jV)° 

Mr(a)r(i-«) 

M[r(a + l)]T(l-a) 



for ^ <v<v\ (28a) 
for v* < v. (28b) 



In Fig. 2 this is compared to simulation results with net- 
works of the order M = 10 3 , the size N = 10 5 , and the 
Levy-exponent a = 1/2. Eq. (28b) implies that for large 
degree expectation values v the pdf p v {v) follows the 



power-law: p v {y) 



regardless of the exact value of 



a. This v -behavior can be seen in Fig. 2. For a < 1/2 
the first regime does always exist because then 2Nr* < 



2N/M 2 = p < 1 < [aT(a + 1 



lV(a-l) 



(see Fig 2). For 



a > 1/2, the existence of the first regime depends on how 



sparse the network is. If [aT(a + l)] 1/( " 1} < 2N/M 1 / 01 , 
this regime is not visible, and the ^ _2 -behavior is ob- 
served over the whole domain of relevant v. The asymp- 
totics of degree distributions P{k) follows the asymp- 
totics of p v {v). 



IV. SUMMARY 

Many real world networks exhibit degree distributions 
which follows a power-law P(k) oc fc _1_a . The gener- 
alizations of classical models including intrinsic fitness 
parameters are considered to be applicable to generate 
such networks. 

We discussed in detail one of such models concentrat- 
ing especially on the domain of < a < 1 in which the 
corresponding distributions do not posses a mean. We 
first examined the situations in which the model is used 
to produce a multigraph Then we studied what happens 
if the multiple edges are merged to a single one and thus 
a simple graph is built. We gave the relation between the 
(normalized) fitness parameter r and the expected degree 
v of a vertex and showed that it possesses non-trivial 
intermediate and final asymptotic behaviors. Especially 
interesting is the fact that for large values of expected de- 
grees v the final asymptotics is universally p v (y) oc v~ 2 
and does not depend on a. This behavior is carried over 
to the actual degree distribution. 
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